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Massive stars are very rare, but their extreme luminosities make them both 
the only type of young star we can observe in distant galaxies and the dominant 
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energy sources in the universe today. They form rarely because efficient radiative 
cooling keeps most star-forming gas clouds close to isothermal as they collapse, 



fSj ' and this favors fragmentation into stars < 1 M Q (solar mass; ref. 1_3 ). Heating 
q ■ of a cloud by accreting low-mass stars within it can prevent fragmentation and 
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allow formation of massive stars 4 ' 5 , but what properties a cloud must have to 
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form massive stars, and thus where massive stars form in a galaxy, has not 
yet been determined. Here we show that only clouds with column densities 
^1 g cm -2 can avoid fragmentation and form massive stars. This threshold, 
and the environmental variation of the stellar initial mass function (IMF) that 
it implies, naturally explain the characteristic column densities of massive star 
clusters 6-9 and the difference between the radial profiles of Ha and UV emission 
in galactic disks 10 ' 11 . The existence of a threshold also implies that there should 



be detectable variations in the IMF with environment within the Galaxy and 
in the characteristic column densities of massive star clusters between galaxies, 
and that star formation rates in some galactic environments may have been 
systematically underestimated. 

Consider a simple model system: a spherical gas cloud of mass M, column density E, 
radius R = \J Mj (7rE), and density profile p oc r~ kp , with a point source of luminosity L 
at its center, representing the radiation output by stars beginning to form within it. In the 
limit L — > 0, the cloud falls to a background temperature T b set by the balance between 
cosmic ray heating and molecular and dust cooling. We are interested in the earliest stages 
of cloud collapse, so we adopt k p — 1. This puts most of the mass at low density, and is 
expected if clouds are in rough hydrostatic balance and obey the observed molecular cloud 
linewidth-size relation 12 a oc r q , where a is the velocity dispersion, r is the size scale, and 
q ~ 0.5. However, any choice in the range 1 < k p < 2 yields the same qualitative conclusions. 

The dust in a spherical cloud with a central source of illumination has a powerlaw 
temperature structure T = T ch (r / R c h)~ kT , where T ch , R ch , and &t are functions of the 
cloud column density S, the light to mass ratio r\ = L/M, and the dust opacity, which we 
characterize through a parameter S that we define below 13 . As we show in the Supplementary 
Information using a grain-gas energy exchange code 14-16 , at the high densities with which 
we are concerned, the gas temperature will be nearly identical to the dust temperature. The 
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temperature will be everywhere greater than Tj-, if 



T ch {v, £,<J) 



R 



-fc T (r?,S,(5) 



1) 



_-R ch (?7, 

Since fcx is generally close to 0.4 for strong sources of internal illumination and large R/R c h, 
a cloud satisfying this condition has an effective adiabatic index 7 w 1.4 throughout its 
volume. Since even 7 pa 1.1 — 1.2 is sufficient to suppress fragmentation 5 , equation (JT]) 
implicitly defines a critical light to mass ratio //halt above which fragmentation will halt in a 
cloud with a given £, 5, and Tt,. We describe our procedure for solving this equation in the 
Supplementary Information. 

We approximate the infrared dust opacity as k = <5/to(Ao/A) 2 , where 5 is a dimensionless 
number that we define to be unity at solar metallicity, A is the radiation wavelength, and 
Ao = 100 /iin. Observations in the Milky Way indicate that, in cold regions where dust grains 
are coated with ice mantles, k,q pa 0.54 cm 2 g _1 (ref. 13,17 ). Under Milky Way conditions 
the minimum temperature for interstellar gas is pa 10 K, with a weak density- dependence 
that we ignore for simplicity. In addition to the Milky Way case, we also consider 5 = 0.25, 
Tb = 10 K, appropriate for in a low metallicity galaxy today, and 5 = 0.25, = 15 K, 
typical of a galaxy at z ~ 6 that has low metallicity but a temperature floor of 15 K imposed 
by the cosmic microwave background (CMB). Figure [1] shows the value of //halt computed for 
the three cases. We find that //halt declines with S because at higher S a cloud of fixed mass 
has a smaller radiating area and remains warmer at fixed luminosity. 

Clouds containing massive stars can have light-to- mass ratios of 100 L & /M Q 18 , more 
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than sufficient to stop fragmentation, but we are interested in clouds where no massive 
stars have yet formed because fragmentation breaks all collapsing objects down to small 
masses. For a low-mass protostar the dominant energy source is gravitational potential 
energy radiated away by accreting gas. We plot the energy released per unit mass accreted, 
ip, in Figure [2j Consider a cloud converting its mass into stars at a rate M* with a mass 
distribution dn/dhim* and a mean mass m* = J m* (dn/d In m* ) d In m*. Once the rate at 
which new stars in a cloud begin accreting balances the rate at which other stars reach their 
final mass and stop accreting, the light-to-mass ratio is 

^ = ^{^)ld^:^ dlnm * (2) 

= ^y^WiMF> ( 3 ) 
where SFRff = (M*tg)/M is the fraction of a cloud's mass that it turns into stars per 
mean density-free-fall time tg, and (V^imf = ™^ 1 /(^ n /^l nm *)'/' m *^l nm * is the IMF- 
averaged value of ip. For a Chabrier IMF 19 truncated at a maximum mass of 1 M , 
(^)imf = 2-1 x 10 14 erg g _1 = 0.11(GMq/Rq). Observations constrain SFR ff to be a few 
percent 20 ' 21 , and in the Supplementary Information we use an analytic fitting formula 22 to 
estimate SFR ff « 0.041(M 2 S /T b 2 1 )- a08 , where M 2 = M/(100 M ), S = E/(l g cm" 2 ), 
and T b l = T b /(10 K). Combining our estimates for (ip) mF and SFR ff with the definition of 



the mean density free-fall time (tg = \JS7r/ (32Gp) = 38.6 M^Xq 3 ^ 4 kyr), we find that the 
light-to-mass ratio of a cloud powered by accretion onto low-mass stars is 

%rav -3.6M 2 - - 33 Sr% ; 16 (^). (4) 
For the models shown in Figure [2] a cloud reaches its equilibrium light-to-mass ratio ^ 3tg 



after star formation begins, and since SFRg ^ 0.05 at most ~ 15% of the mass will have 
gone into low-mass stars at this point. If star formation accelerates in time, as predicted by 
some models 23 , then ?7g rav will reach the value given by equation (TjJ even earlier. 

In Figure [fl we show ?7g rav computed for some typical parameters overplotted with 
^hait- For each cloud mass M, we solve equations ([T]) and (0J to find the column density S th 
such that r/hait > ?7grav, and plot the result in Figure [3j This is the threshold required for 
fragmentation to halt. We find that thresholds of 0.7 — 1.5 g cm -2 are required to form stars 
of 10 — 200 M & under Milky Way conditions. Lower metallicity galaxies with comparable 
background temperatures require column densities that are a factor of ~ 3 smaller, while 
galaxies at z « 6 with low metallicity but high CMB temperatures require higher column 
densities by a similar factor. 

The existence of a threshold for massive star formation both explains current obser- 
vations and predicts future ones. Regions with column densities ^ 1 g cm" 2 are rare even 
among star forming clouds, and contain only a small fraction of the molecular mass in the 
Galaxy, but our threshold explains why all nearby regions of massive star formation have S 
at or above this value 6-9 . We further predict that clusters formed with £ 1 g cm -2 should 
be deficient in massive stars. This is probably unobservable in individual low-column density 
clusters because they contain too few stars, but a statistical analysis of many clusters might 
reveal the effect. 

Conversely, suppression of fragmentation should produce top-heavy IMFs at high gas 
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column densities. This prediction can be tested by x-ray searches for low-mass protostars in 
high £ clouds that are not detected by Spitzer at 24 /im and therefore contain no massive 
protostars 24 . We predict that any low- mass protostellar populations detected will constitute 
at most 15% of the total mass. This prediction provides a sharp test for distinguishing our 
model from competitive accretion models, which predict that the mass of the most massive 
star forming in a cloud is related to the mass of low-mass stars around it by (M mass /M ) xs 
(M low „ mass /M ) 2 / 3 (ref. 23 ). Thus we would predict that a 100 M , £ > 1 g cm 2 cloud 
with no stars larger than 10 M should have a total stellar content below 15 M , while 
competitive accretion would allow up to 42 M of low-mass stars. However, since radiation 
does not halt fragmentation until some low-mass stars have formed, we do expect most 
massive stars to form surrounded by a cluster. 

Environmental variation at the top end of the IMF has even more profound conse- 
quences for extragalactic astronomy, since observations of distant galaxies are generally sen- 
sitive only to massive stars. The threshold explains why Ha emission in galactic disks ends at 
sharp edges where the disks transition from gravitationally unstable to gravitational stable 10 , 
but UV emission declines smoothly with radius and does not show a feature at the Ha edge 11 . 
Ha and UV are both tracers of recent star formation but are sensitive to different parts of 
the IMF. Outside the gravitational stability radius, the molecular-to-atomic surface den- 
sity ratio drops sharply 25 , probably because purely local instabilities create small molecular 
clouds but not giant complexes like those in disks' inner parts. Since compressing gas to 
column densities of E th requires a huge amount of weight that can only be provided by such 
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giant complexes, their absence will selectively suppress the formation of the most massive 
stars, leading to a truncated IMF. If in such a region no stars larger than, for example, 15 
M Q were to form, this would reduce UV emission by ~ 50% but would eliminate more than 
99% of the Ha light 26 , explaining the sharp drop in Ha but not UV. 

An important corollary to this point is that estimates of the star formation rate assum- 
ing a standard IMF in regions that do not contain giant clouds, such as much of the volume 
of dwarf galaxies and the outer parts of disk galaxies, may be systematically too low. At 
this point our theory is too approximate to allow a precise calculation of the extent of the 
underestimate. 

Our calculation of £ t h also enables us to predict how the characteristic column densities 
of young clusters containing massive stars will vary with Galactic environment. We predict 
that clusters that have cleared their gas but not yet dynamically expanded should show a 
minimum column density near E t h (probably slightly below £ t h due to gas removal), and 
this should be lower at low metallicity and higher at high redshift as indicated in Figure [3J 
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Figure 1: Critical and equilibrium light to mass ratios versus cloud column density. The 
plot shows the critical light to mass ratio r^ait (solid lines) and the equilibrium light to mass 
ratio 77 grav due to low mass star formation (dashed lines) as a function of the cloud column 
density £. The three curves for ?7 halt are computed for S — 1, T b = 10 K, for S — 1, T b = 15 
K, for 5 = 1/4, Tb = 10 K, and for 5 = 1/4, T b = 15 K, as indicated. The two sets of curves 
for 7/grav are computed for M = 20 M & and M = 200 M Q , as indicated, corresponding to the 
clouds that would be required to form 10 M e and 100 M & stars for a typical star-formation 
efficiency of 50%. 27 In each pair the lower curve corresponds to T b = 10 K and the upper 
to T b = 15 K. Note that the background temperature can be higher than our assumed T b 
in regions near massive stars, but it is unclear whether massive stars in a cluster ever form 
close enough in time so that the first to form can affect the formation of subsequent ones. 
In the Orion Nebula 28 and W3 Main 29 clusters, all the stars larger than 10 M that remain 
today formed over a time spread < 10 5 yr. This is comparable to the formation time of a 
single massive star 30 , so that the last massive star to form must have been well-along by the 
time the first began to heat its envelope. Even in non-coeval clusters, our approach applies 
to the first massive stars. 
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Figure 2: Energy per unit mass radiated and formation time versus protostellar mass. The 
plot shows the energy per unit mass ip radiated away in the process of forming a star of mass 
m* (a), and the time required to form the star tf (b). The tracks shown are computed using 
a one-zone protostellar evolution code 9 applied to the accretion histories predicted by refs. 
9 ' 30 using their fiducial parameters, for protostellar cores born in environments where the 
column density is £ = 0.1, 1.0, or 10.0 g cm~ 2 as indicated. However, alternative accretion 
histories give qualitatively identical results. Note that ip is nearly independent of both 
accretion history and final stellar mass because the entropy distribution within a protostar 
and the protostellar mass-radius relation are nearly constant on time scales short compared 
to the Kelvin-Helmholtz time £kh, and for low-mass stars tf <C £kh ~ 10 Myr. This means 
that ip, which is a measure of the gravitational energy released, is nearly independent of 
accretion history. Moreover, since low-mass stars have nearly linear mass-radius relations, 
ip is also nearly independent of the final stellar mass. Although our calculation of ip uses a 
code calibrated to solar metallicity, our results should also apply over a very wide range of 
metallicities because even for low metallicity stars tf <C £kh- 
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Figure 3: Threshold column density versus stellar mass. The plot shows the threshold 
column density E t h required to form a star of mass m* for 5 — 1, Tt, — 10 K, for 5 = 1/4, 
Tb = 10 K, and for 5 — 1/4, Tt, — 15 K, as indicated. In making the plot we assumed an 
efficiency of 50% 27 , so that the cloud mass required to make a star of mass m» is M = 2m*. 
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Supplementary Discussion 



In this Supplementary Discussion, we give more detailed derivations of several of the 
results in our letter. 

First, we describe in detail our method of calculating the temperature structure of our 
clouds and determining when they are dominated by internal illumination. As discussed in 



and density profile p oc r~ kp . The gas in the cloud is mixed with dust, which provides an 
opacity per unit mass k = S k,o(\q / X) 13 , where A is the radiation wavelength, Ao = 100 /xm, 
k = 0.54 cm 2 g -1 , (3 = 2, and the choice 5=1 corresponds to typical properties in cold gas 
in the Milky Way, where ice mantles on dust grains double the opacity compared to that in 
the diffuse interstellar medium 17 . For convenience we define T = hc/(k B \o) = 144 K. There 
is a source of luminosity L located at the cloud center, and we define 77 = L/M. 

To compute the cloud temperature, we first estimate the dust temperature using the 
method of ref. 13 , who show that the dust temperature profile in a cloud of this sort takes 
an approximately powerlaw form: 



The characteristic emission radius and emission temperature T ch are defined by two 
conditions: the total luminosity must be L = AttLR^osbTX, where L is a constant of order 
unity to be determined, and the optical depth from the cloud edge to _R c h must be unity for 



the main text, we consider a cloud of mass M, column density S, radius R = 





(1) 
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radiation whose wavelength is A = he/ (k B T ch ). Together these conditions imply 



R \f T] 



(3 - kp)K 



R ch \\4a SB Lj [4(k p -l)Tg\ 

_ (3 - kp)K 



where a = l/[2(3 + 4(k p — 1)]. The approximation 



(2) 
(3) 



L « 1.6i? Ui (4) 

reproduces numerical solutions of the transfer equation very accurately, so we adopt it. 
The temperature structure within the cloud takes an approximately powerlaw form, T = 
T c h{r I R)~~ kT , with k^ well- fit by the approximation 

0.48fc°- 05 O.lfc^- 5 

U ~ P__ j P_ rC\ 

~ R°- 02kl p 09 R°- 7k p 9 ' 

Note that the approximations in ref. 13 break down and the equations become singular 
as k p — > 1, but numerical solutions of the transfer equation show that the temperature 
profiles for k p = 1 and k p = 1.1 are nearly identical. Thus, we handle the case k p < 1.1 
by approximating it with k p = 1.1. Also note that equation (j4]) is a slightly different 
approximation than that given in ref. 13 . This approximation provides a somewhat better 
fit than the approximation given there (S. Chakrabarti, private communication, 2007). 

If the dust and gas temperatures are equal, it is straightforward to solve equation 
(1) in the main text, so we proceed by assuming that they are equal and then check that 
assumption. Equations ([2D - Q constitute three equations in the unknowns R c h, T ch , and 
L, which we may solve for specified values of S, rj, and k p . Together with equation ([5]), this 
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fully specifies the dust temperature profile in the cloud. To solve equation (1) in the main 
text for ?7hait) we simply fix 5 and E, and iterate to obtain the value of i] that satisfies the 
equation. 

Our second calculation in the Supplementary Discussion is a check of the assumption 
that the dust and gas temperatures are equal using both a simple analytic estimate and using 
a detailed numerical calculation of dust-gas energy exchange. We perform this check for the 
threshold column densities E th that we obtain under the assumption that the dust and gas 
temperatures are equal. In the vicinity of an illuminating source that heats the dust, the 
gas temperature will be determined by a competition between the dominant heating process, 
collisions with warm dust grains, and the dominant cooling process, molecular cooling. The 
former will be least effective and the latter most effective at the low density edge of a cloud, so 
we focus our attention at cloud edges, where the temperatures are near Tb and the densities 
are lowest. The volumetric gas heating rate by dust collisions is approximately 15 

(T d -T g ) erg cm" 3 s"\ (6) 

where riu is the number density of hydrogen nuclei, T g is the gas temperature, T d is the dust 
temperature, and <r d0 is the average dust cross section per baryon, normalized to the fiducial 
Milky Way value, 6.09 x 1CT 22 cm~ 2 . We make the simple assumption that the dust-grain 
cross section varies with metallicity in the same way as the total dust opacity, so we adopt 
<7d0 = 5. 

The dominant dust cooling process is molecular line emission, with CO dominating at 



T gd = 9.0 x lO-^nlTj Vdo 



0.8 exp 



75 K 
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lower densities and optical depths and other species taking over at higher densities. The 
exact cooling rate is a very complex function of density, temperature, and optical depth, 
but numerical radiative transfer and molecular excitation calculations 14 show that, for a gas 
temperature near T g = 10 K, the total cooling rate has a roughly constant value of 

Amoi ~ KT 27 5n H erg cm" 3 s _1 (7) 

for n H in the range 10 3 — 10 7 cm -3 . We have again taken the molecular cooling rate to be 
simply proportional to the metallicity. Adopting this cooling rate, and solving for the gas 
energy balance by setting r g d = A mo i, we find 

m m 3.5 x 10 5 

T d - T g « K. (8) 

Thus, the gas and dust temperature will be identical to within 1 K for densities n H ~ 3.5 x 10 5 
cm -3 , roughly independent of 5. 

We can now compare this to the minimum densities found in gas clouds at the critical 
column density. For a cloud of mass M and column density S, the number density at the 
cloud edge is 



where \x = 2.34 x 1CT 24 g is the mean mass per H nucleus for a gas of the standard cosmic 
abundance. For stars in the mass range 10 — 200 M , for the threshold column densities 
£ t h shown in Figure 3 of the main text, the minimum values of uh are 7.9 x 10 5 , 3.4 x 10 5 , 
and 3.8 x 10 6 cm" 3 for the cases 5 = 1 and T b = 10 K, 5 = 0.25 and T b = 10 K, and 
5 = 0.25 and T b = 15 K, respectively. These values suggest that the cloud volume densities 
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are high enough for our threshold cases that our assumption of dust-gas temperature equality 
is well-justified, although the case 5 = 0.25, T b = 10 K is perhaps marginal. 

We further check this approximation by computing the gas temperature using a detailed 
gas-temperature calculation code 15,16 . The code takes as input fixed spherically-symmetric 
density and dust temperature profiles, and computes the resulting gas temperature profile. It 
includes grain-gas energy exchange, cosmic ray heating, and line cooling from a large number 
of species, including an approximate treatment of radiative trapping effects in the Sobolev 
approximation. In the code, we set the cosmic ray ionization rate to its Milky Way value, 
we set the grain-gas energy exchange rate to the value given by equation and we scale 
the tabulated molecular cooling rate by S under the assumption that the molecular cooling 
rate is simply proportional to metallicity. We also assume that our clouds are embedded in 
an environment where the column density is 10% of E t h- 

Supplementary Figure 1 shows the cloud temperature profiles and the fractional differ- 
ence between the dust and gas temperatures for the cases 5 = 1 and T b = 10 K, 5 = 0.25 and 
Tb = 10 K, and 5 = 0.25 and Tb = 15 K. The cases shown are for clouds with mass M = 200 
Mq, which we approximate will produce 100 M Q stars if they do not fragment 27 , but different 
cloud masses do not produce qualitatively different results. As the plots show, the dust and 
gas temperatures agree to within a few percent even at the cloud edges. Thus, our assump- 
tion of dust-gas temperature equality is well-founded. In fact, the detailed calculation shows 
that our analytic approximation overestimates the grain-gas temperature difference. This 
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is likely because in our analytic estimate we neglected cosmic ray heating, which becomes 
comparable to the assumed cooling rate A mo i when the temperature is near Tj-,, and because 
our analytic estimate of A mo i is based on calculations for somewhat lower column density 
regions than the ones we are considering, and the cooling rate is higher at lower column 
density due to decreased line opacity. 

The third and final calculation we present in the Supplementary Discussion is a deriva- 
tion of the estimated total rate of star formation in our centrally condensed cloud. We base 
our estimate on a fit to the star formation rate in simulations of driven turbulent motion in 
periodic boxes 22 

SFR ff _ uniform « 0.073c^- 68 .M- - 32 , (10) 

where a vu - is the cloud virial ratio and A4 is its one-dimensional Mach number on scales 
comparable to the size of the entire cloud. The definition of the virial ratio is 

a *" = -GM-' (U) 

where c s is the cloud's isothermal sound speed, which we compute at T b . Following obser- 
vations, we adopt a v - ir = 1.3 as typical of star- forming clouds 9 , and this implies that 

„-I(^)*- M (*«f, 

where M 2 = M/(100 M Q ), S = E/(l g cm" 2 ), and T hil = T b /(10 K). In the numerical 
evaluation, we have taken the mean mass per particle of the gas to be 2.33m H , appropriate 
for a gas of the standard cosmic abundance. Using this value of M. in equation (flOj) gives 
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an estimate of SFRg on the outer scale of the cloud: 

SFR ff _ 0Ut » 0.034 {^^j ■ (13) 

This would be the value of SFRg if the cloud were uniform. For a centrally-concentrated 
cloud, if one assumes that equation (ITU]) applies locally everywhere within the cloud, the 
star formation rate is enhanced over SFRfj_ out by a factor of (3 — /c p ) 3 ' 2 /[2.3(2 — k p )] (ref. 
20 ). This enhancement occurs because the mass-averaged free-fall time is higher and the 
mass-averaged Mach number is lower for a centrally-concentrated cloud than for a uniform 
one. Applying this factor to SFRg^uniform for k p = 1, we arrive at our final estimate for SFRg 
for the cloud: 

SFR, « 0.041 f^)" 08 . (W) 
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Supplementary Figure 1: Dust temperature and dust-gas temperature difference in a gas 
cloud. The left panel shows the dust temperature as a function of radius computed for clouds 
of mass M = 200 M with critical column densities, for the cases 5 = 1 and T b = 10 K (solid 
lines), 5 = 0.25 and T b = 10 K (dotted lines), and 5 = 0.25 and T b = 15 K (dashed lines). 
We do not show the gas temperature profile separately because the curves so closely overlay 
one another as to be indistinguishable. In the right panel, we show the fractional difference 
in dust and gas temperatures (T d — T g )/T d as a function of radius. 
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